Efficiently Identifying Exploratory Rules' Significance
نویسندگان
چکیده
How to efficiently discard potentially uninteresting rules in exploratory rule discovery is one of the important research foci in data mining. Many researchers have presented algorithms to automatically remove potentially uninteresting rules utilizing background knowledge and user-specified constraints. Identifying the significance of exploratory rules using a significance test is desirable for removing rules that may appear interesting by chance, hence providing the users with a more compact set of resulting rules. However, applying statistical tests to identify significant rules requires considerable computation and data access in order to obtain the necessary statistics. The situation gets worse as the size of the database increases. In this paper, we propose two approaches for improving the efficiency of significant exploratory rule discovery. We also evaluate the experimental effect in impact rule discovery which is suitable for discovering exploratory rules in very large, dense databases.
منابع مشابه
شناسایی عوامل موثر بر اجرای موفق خطمشی حمایت از شرکتها و مؤسسات دانشبنیان و تجاریسازی نوآوری و اختراعات
The present study is carried out to identify the effective factors in the successful implementation of “Support for knowledge-based companies and institutes and commercialization of innovation and inventions”, enacted by the Parliament. Therefore, while identifying the effective factors, the significance and function of each factor in the successful implementation of the above-menti...
متن کاملDiscarding Insignificant Rules during Impact Rule Discovery in Large, Dense Databases
Considerable progress has been made on how to reduce the number of spurious exploratory rules with quantitative attributes. However, little has been done for rules with undiscretized quantitative attributes. It is argued that propositional rules can not effectively describe the interactions between quantitative and qualitative attributes. Aumann and Lindell proposed quantitative association rul...
متن کاملAn Exploratory Survey of Hadoop Log Analysis Tools
In view of the fact that clusters used in large scale computing are on the rise, ensuring the wellbeing of these clusters is of paramount significance. This highlights the importance of supervising and monitoring the cluster. In this regard, many tools have been contributed that can efficiently monitor the Hadoop cluster. The majority of these tools congregates necessary information from each o...
متن کاملPredictive Top-Down Knowledge Improves Neural Exploratory Bottom-Up Clustering
In this paper, we explore the hypothesis that integrating symbolic top-down knowledge into text vector representations can improve neural exploratory bottom-up representations for text clustering. By extracting semantic rules from WordNet, terms with similar concepts are substituted with a more general term, the hypernym. This hypernym semantic relationship supplements the neural model in docum...
متن کاملTHE SIGNIFICANCE OF JEEP TAG: On Player-Imposed Rules in Video Games
Video games, unlike traditional, non-digital games, are based on a combination of fixed rules which cannot be broken from the player position, and implied rules which are not enforced by the computer program. It is relatively common, however, for players to impose additional or alternative rules on video games, in order to refine or expand gameplay and to create new gaming experiences. This pap...
متن کامل